Parity Striping of Disk Arrays: Low-Cost Reliable Storage with Acceptable Throughput
نویسندگان
چکیده
An analysis of mirrored discs and of RAID5 shows that mirrors have considerably better throughput, measured as requests/second on random requests of arbitrary size (up to 1MB). Mirrors have comparable or better response time for requests of reasonable size (less than 100KB). But mirrors have a 100% storage penalty: storing the data twice. Parity striping is a data layout that stripes the parity across the discs, but does not stripe the data. Parity striping has throughput almost as good as mirrors, and has cost/GB comparable to RAID5 designs -combing the advantages of both for high-traffic disc resident data. Parity striping has additional fault containment and software benefits as well. Parity striping sacrifices the high data transfer rates of RAID designs for high throughput. It is argued that response time and throughput are preferable performance metrics. Outline Introduction Why Striping and RAID Are Inappropriate for OLTP Systems Parity Striping: Cheap Reliable Storage Plus High Throughput An Analysis of Mirrors, RAID5, and Parity Stripe Performance Mirrored Discs Parity Stripe Discs RAID5 Discs Applying the Analysis to a 10+2 array The Read-Only Case The Write-Only Case Analyzing a High-Performance Disc Other Benefits of Parity Striping Summary Acknowledgments References Appendix: Spreadsheets Corresponding to the Graphs
منابع مشابه
Multi-Terabyte EIDE Disk Arrays running Linux RAID5
High-energy physics experiments are currently recording large amounts of data and in a few years will be recording prodigious quantities of data. New methods must be developed to handle this data and make analysis at universities possible. Grid Computing is one method; however, the data must be cached at the various Grid nodes. We examine some storage techniques that exploit recent developments...
متن کامل1 9 N ov 2 00 4 Multi - Terabyte EIDE Disk Arrays running Linux RAID 5
High-energy physics experiments are currently recording large amounts of data and in a few years will be recording prodigious quantities of data. New methods must be developed to handle this data and make analysis at universities possible. Grid Computing is one method; however, the data must be cached at the various Grid nodes. We examine some storage techniques that exploit recent developments...
متن کاملSwift/RAID: A Distributed RAID System
The Swift VO architecture is designed to provide high data rates in support of multimedia type applications in general-purpose distributed environments through the use of distributed striping. Strþing techniques place sections of a single logical data space onto multiple physical devices. The original Swift prototype was designed to validate the architecture, but did not provide fault tolerance...
متن کاملSelf-Repairing Disk Arrays
As the prices of magnetic storage continue to decrease, the cost of replacing failed disks becomes increasingly dominated by the cost of the service call itself. We propose to eliminate these calls by building disk arrays that contain enough spare disks to operate without any human intervention during their whole lifetime. To evaluate the feasibility of this approach, we have simulated the beha...
متن کاملThe Zebra Striped
Zebra is a network file system that increases throughput by striping the file data across multiple servers. Rather than striping each file separately, Zebra forms all the new data from each client into a single stream, which it then stripes using an approach similar to a log-structured file system. Thm provides high performance for writes of small files as well as for reads and writes of large ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 1990